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Abstract. - 

We analyze a database comprising quarterly sales of 55624 pharmaceutical products com- 
mercialized by 3939 pharmaceutical firms in the period 1992-200f . We study the probability 
density function (PDF) of growth in firms and product sales and find that the width of the 
PDF of growth decays with the sales as a power law with exponent /3 = 0.20 ± 0.01. We also 
find that the average sales of products scales with the firm sales as a power law with exponent 
a — 0.57 ± 0.02. And that the average number products of a firm scales with the firm sales as 
a power law with exponent 7 = 0.42 ± 0.02. We compare these findings with the predictions of 
models proposed till date on growth of business firms. 



In economics there are unsolved problems that involve interactions among a large number 
of subunits [1-3]. One of these problems is the structure of a business firm and its growth [2,4]. 
As in many physical models, decomposition of a firm into its constituent parts is an appropriate 
starting place for constructing a model. Indeed, the total sales of a firm is comprised of a 
large number of product sales. Previously accurate data on the "microscopic" product sales 
have been unavailable, and hence it has been impossible to test the predictions of various 
models. Here we analyze a new database, the Pharmaceutical Industry Database (PHID) 
which records quarterly sales figures of 55624 pharmaceutical products commercialized by 
3939 firms in the European Union and North America from September 1991 to June 2001. 
We shall see that these data support the predictions of a simple model, and at the same time 
the data do not support the microscopic assumptions of that model. In this sense, the model 
has the same status as many statistical physics models, in that the predictions can be in accord 
with data even though the details of microscopic interactions are not. The assumptions of this 
simple model given in Ref [5] are as follows: (i) Firms tends to organize itself into multiple 
divisions once they attain a particular size, (ii) The minimum size of firms in a particular 
economy comes from a broad distribution, (iii) Growth rates of divisions are independent of 
each other and there is no temporal correlation in their growth. With these assumptions the 
model builds a diversified multi-divisional structure. Starting from a single product evolving 
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to a multi-product firm, this model reproduces a number of empirical observations and make 
some predictions which we discuss in detail below along with results and predictions of other 
models which attempt to address the problem of business firm growth. 

Consider a firm i of sales Si with Ni products whose sales are £ij where j — 1, 2, TVj. 
Thus the firm size in terms of the sales of its products is given as Si — ^jZi' ■ The growth 
rate is 

9i (t) = log ( ^g^ ) = l°gS(t + At) - logSi(t), (1) 

where Si(t) and Si(t + At) are the sales, in units of 10 3 British Pounds, of firm i being 
considered in the year t and t + At, respectively. Pharmaceutical data has seasonal effect, 
and hence the analysis of quarterly data will have effects due to seasonality. To remove any 
seasonality, that might be present, we analyze the annual data instead of the quarterly data. 

Recent studies have demonstrated power- law scaling in economic systems [6] . In particular 
the standard deviation a of the growth rates of diverse systems including firm sales [6] or 
gross domestic product (GDP) of countries [7] scales as a power-law of S. 

The models of Rcfs [1-3,5,8] all predicts that standard deviation of growth rates amongst 
all firms with the same sales scales as a power law a(g\S) ~ S'-' 3 . Further, model of Refs [5] 
predicts that probability density function PDF p(g\S), of growth rates for a size of firm S 
scales as a function of S as : 

p(q\s) ~ ^ f (^y (2) 

where /o is a symmetric function of a specific "tent-shaped" form resulting from a convolution 
a log normal distributions and a Gaussian distribution, with parameters dependent on the 
parameters of the model. Figure la plots the scaling of the standard deviation a(g\S). We 
observe cr(g\S) ~ S~^ with /3 = 0.19 ± 0.01. Figure lb plots the scaled PDF as given by eq. 2 
for three sales groups; small (S < 10 2 ), medium (10 2 < S < 10 4 ) and large (10 4 < S). The 
figure also plots /o as predicted by refs [5]. The model of Ref [5] further predicts that the 
PDF of the product size £ for a fixed firm size S, pi(£\S) should scale as 

Pittl*)-^ • (3) 

where again f\ depends on the parameters of the model. According to the model discussed 
in Refs [5,8] /i is a log-normal PDF. We then evaluate the average product size E(£\S) in a 
firm of size S, defined as E(£\S) = J d£pi(£\S)£ ~ S a . Figure 2b plots E(£\S), we observe 
E(£\S) ~ S a with a = 0.57 ±0.02. Figure 2b plots the scaled PDF as given by eq. 3 for three 
sales groups; small (S < 10 2 ), medium (10 2 < S < 10 4 ) and large (10 4 < S). We observe that 
for each of the groups the PDF pi(£,\S) is consistent with a log-normal distribution by noting 
in a log- log plot the PDF pi(^\S) is parabolic which is tested by performing a regression fit. 

According to Ref [5], the PDF p 2 (N\S) of number of products N in a firm of size S should 
obey the scaling relation: 

MN\S) ~ £ f 2 (£) • (4) 

where the function / 2 is log-normal and depends of the parameters of the model. We evaluate 
the average number of products E(N\S) for a firm of size S. Using eq. 4 we note that 
E(N\S) = J dNp 1 (N\S)N - ST Figure 3a plots the expectation E(N\S) and we observe 
that E(N\S) - 5* 7 with 7 = 0.42 ± 0.01. Figure 3b plots the scaled PDF p 2 (N\S) as given by 
eq. 4 for three groups; small (5 < 10 2 ), medium (10 2 < S < 10 4 ) and large (10 4 < S) . 
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Fig. 1 - (a) Firms are divided into 10 groups according to firm sale S. We find the standard deviation 
a(g\S) of the growth rates scales as a power law, a(g\S) ~ S' 13 with /3 = 0.20 ± 0.01. Symbols 
are data points and solid line is a regression fit. (b) PDF of the growth rates for small [S < 10 2 ], 
medium [10 2 < S < 10 4 ], and large [10 4 < S] values of S is scaled by their standard deviation. 
Note the collapse of the histograms of the three groups which confirms the scaling exponent f3. The 
dashed line is /o as predicted by the model approximating the results of ref [5] given by fo{x) ~ 
ao exp{(— ai (1 + 0.75 ln(l + ai x 2 )]}[l + 0.75 ln(l + ai x 2 )] 1 ^ 2 where ao, ai,and ai are parameters 
of the model. 

According to [5] the relations between the scaling exponents a ft, and 7 are given by 

7 = 1 - a, (5) 

» - ^ < 6 ' 

which we find to be approximately valid for the PHID database. 

According to the model discussed in refs [5], the distribution of product sizes in each 
firm scales with the product size at launch 5 a t launch, according to, po^/Sat launch) which is 
approximately log-normal. Model [5] postulates that the PDF of 5 a tiaunch is log normal, i.e., 
P(log S^t launch) is Gaussian with variance W p and each firm is characterized by a fixed value 
of S at launch- Furthermore, ref. [5] predicts that the distribution of firm sales is close to log 
normal i.e., the PDF P(log5) is Gaussian with variance Wf. With these hypothesis ref. [5] 
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(a) 




Firm sales, S 




Fig. 2 - (a) Mean, E(£ \S) of the product sale conditioned for a firm of sale S. We observe that 
the mean scales as E(£\S) ~ S a with a — 0.57 ± 0.02. Symbols are data points and solid line is 
a regression fit. (b) PDF of the product sales for small [S < 10 2 ], medium [10 2 < S < 10 4 ], and 
large [10 4 < S] values of S scaled by S "' 7 . Note the collapse of the PDF's of the three groups which 
confirms the scaling exponent a. 



derives that, 

0= W f- W " (7) 
P 2W f [! 

Figure 4 plots PDF of annual products sales, products sales at launch P(logS , at launch), and 
firm sales P(logS') between 1990-2001. The variance of the PDF's of products sales at launch 
and firm sales are estimated to be W p = 0.88, Wf = 1.72 respectively. This gives (3 = 
(Wf — W p )/2 Wf = 0.24 which is approximately what is observed empirically. We employ 
two methods to estimate W (the standard deviation) : (i) Estimate W from the definition, 
i.e. W 2 — (1/(N — 1)) Y^i=i ( x i~ < x >) 2 where {x\, x 2 , ...xn} is a set of data and < x > 
is the mean of the set {xi}. (ii) First estimate the PDF from the set {xi}, then perform a 
regression fit with a log-normal function to the PDF. The standard deviation W will be one of 
the fitting parameter. Hence estimate W from the estimated parameter value from the least 
square log-normal fit to the PDF. We observe both this method gives similar values of W and 
the ratio (3 (cf. eq.7 ) remains unchanged as long as we consistently use one of the 2 methods 
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Firm sales, S 




Fig. 3 - (a) Mean, E(N\S) of the number of products N for a firm of sale S. Symbols are data 
points and solid line is a regression fit. We observe that the mean scales as E(N\S) ~ S 1 with 
7 = 0.42 ± O.Of.(b) PDF of the number of product for small [S < 10 2 ], medium [10 2 < S < 10% 
and large [10 4 < S] values of S scaled by S ' 42 . Note the partial collapse of the PDF's for of the 
three groups which confirms the scaling exponent 7. For small values of N, which also corresponds to 
small values of S, the statistics become poor. This statistical errors gets even more amplified when 
we divide small values of N by S* ' 42 . Thus we observe poor quality of data collapse for N/S ' 42 < 1. 
The data collapse is better for N/S 42 > 1 where we have good statistics. 



described above. Our estimate of W presented here is using the former method. 

Ref. [5] postulates products growth rate to be Gaussian and temporally uncorrelated. To 
test this postulate figure 5 plots the PDF P(g) of the growth g of the products At = 1 
and At = 10 year [9]. We see that the empirical distribution is not growing via random 
multiplicative process as ref. [5] postulates but has the same tent shape distribution as the 
distribution of firm sales growth rate, suggesting that the products themselves may not be 
elementary units but maybe comprised of smaller interacting subunits. Figure 5 also plots 
PDF P(g') surrogate data obtained by summation of the 10 annual growth rates from the 
empirical distribution. We observe that the P(gAt=io) f° r products differs from the surrogate 
data implying there are significant anti-correlation in the growth dynamics between successive 
years. 
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Sales, log S 



Fig. 4 - PDF of products sales (diamond) firm sales (circles) between f990-200L The variance of 
the PDF's of products sales at launch and firm sales are estimated to be W v = 0.88 and Wj = 1.72 
respectively. This gives Wj — W p = 0.84 and (3 = 0.24 [cf. eq. 7] which is approximately what is 
observed empirically as predicted by [5]. 



In summary we study the statistical properties of the internal structure of a firm and 
its growth. We identify three scaling exponents relating the (i) sales of the products, £ (ii) 
the number of products, N and (iii) the standard deviation of the growth rates, tr, of a firm 
with its sales S. Our analysis confirms the features predicted in ref [5]. However we find 
that the postulate of the model namely: the growth rate of the products is uncorrelated and 
Gaussian is not accurate. Thus the model of ref. [5] can be regarded as a first step towards 
the explanation of the dynamics of the firm growth. 

We thank L. A. N. Amaral, S. Havlin for helpful discussions and suggestions and NSF and 
Merck Foundation (EPRIS Program) for financial support. 




Fig. 5 - Probability density function (PDF), P(gAt) of products for At = 1 year (solid line) and 
At = 10 years (dashed line). Circles represent the of surrogate data [9]. In absence of correlation we 
expect the data for At = 10 to coincide with the PDF of the surrogate points. 
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